FAST DRIFT AND DIFFUSION IN AN EXAMPLE OF ISOCHRONOUS SYSTEM 

THROUGH WINDOWS METHOD 



ALESSANDRO FORTUNATI 



(N 

o 

(N 
O 

Q 



Q 



> 
m 



ABSTRACT. We study the problem of Arnold's diffusion in an example of isochronous system by using 
a geometrical method known as Windows Method. Despite the simple features of this example, we show 
that the absence of an anisochrony term leads to several substantial difficulties in the application of the 
method, requiring some additional devices as non-equally spaced transition chains and variable windows. 
In this way we are able to obtain a set of fast orbits whose drifting time matches, up to a constant, the time 
obtained via variational methods. 



1. Introduction 

Arnold's Diffusion [Arn64] is a topological instability phenomenon arising in nearly integrable Hamil- 
tonian systems with more than two degrees of freedom. It consists on the existence of a class of motions 
whose effect is to produce a drift of slow variables of order one in a "very long" time T^, i.e. tending to 
infinity as the perturbation size fi approaches to zero, for all sufficiently small fj,. A quite relevant prob- 
lem clearly concerns the estimate of the minimal time in order to observe this phenomenon. The earlier 
literature on this topic, such as [CG94], [Bes96] etc., has shown that the attempts to give an estimate of 
Trf crucially depend on the method used to construct such class of trajectories, rather than the features of 
the system at hand, as it would be natural to expect. The question is addressed in [Gal97] and in [Gal99], 
by taking as a paradigmatic example the following simple a priori-unstable isochronous system 

V 2 

H(p, q,A,<p)=u-A + ^ + (cos q - 1) + fiF(q, <p), (1) 



w • A + — + (cos q — l) + fJ,F(q, f), 

with (q, (p,p,A) € A4 := T*(S X x T 2 ). Some relevant subsequent works on this class of systems (with 
some various generalizations) extent this gap further, as summarized in the following (non-exhaustive) 
table 



(N 
(N 



X 



Work 


T d 


approach 


[Gal97] 


super-exponential 


geometrical 


[Gal99] 


exponential (i.e. 0(exp(/i~ i ))) 


geometrical 


[CreOl] 


polynomial (i.e. O^ -7 )) 


geometrical 


[BB02] 


logarithmic (i.e. 0(fi~ L log fi~ L )) 


variational 



where 7 is a positive 0(1) constant. 

In the much more investigated anisochronous case, the situation was a little bit more encouraging. In par- 
ticular, the logarithmic estimate finally given by the variational approach of [BBB03], was also obtained 
in [CG03] via the so-called Windows Method (WM). This powerful geometrical tool (originally due to 
Aleskeev and Easton, see for instance [Eas81] and references therein), was reconsidered and developed 
in [Mar96] on a simplified version of the Arnold's example. Nowadays, the WM has been deeply im- 
proved leading to some advanced applications on the problem of Arnold's diffusion, see for instance, 
[GZ04], [GR07] and [ES12]. 
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The motivation of this work arises from the fact that a treatment of the isochronous case cannot be ob- 
tained neither from [Mar96] nor from [CG03] by taking as zero the anisochrony coefficient. The main 
reason can be briefly described as follows. As done in [Cre03], it is possible to show the existence of 
a conjugacy between the dynamics in a neighbourhood of a Graff torus and a symbolic dynamics on 
a suitable alphabet and then, in particular, the existence of periodic orbits. This allows to construct in 
[CG03], a chain of (hyperbolic) periodic orbits Ok, surrounding a transition chain of partially hyperbolic 
tori. Arnold's diffusion is obtained via WM as a shadowing of Ok- 

It is clear that the above described argument cannot be applied in the isochronous case as this class of 
systems does not admit periodic orbits: the (Diophantine) frequencies of rotators are fixed and then tra- 
jectories are open for all t, densely filling the underlying flat torus. Beyond the impossibility in adapting 
the construction done in [CG03], a deeper obstruction arises in the isochronous case: the absence of 
an anisochrony term inhibits a remarkable geometrical phenomenon pointed out in [CG01] and called 
"transversality-torsion". The latter is related to a suitable compressing-stretching action of the phase 
flow on windows, giving a key condition for a shadowing result (called correct alignment) in a quite 
natural way. This cannot be done in our case and the correct alignment condition requires a more careful 
treatment. 

In order to overcome this difficulty, we have considered a suitable sequence of "self deforming" (i.e. 
non-constant) windows, calling this mechanism simulated torsion. Such a sequence has been determined 
by solving a one dimensional discrete dynamical system over the transition chain. 
The more restrictive estimates (of technical nature) arising in the isochronous case, also obstruct the use 
of an equally spaced transition chain 1 . For this purpose, we have used the elastic chain tool, introduced 
for the first time in [Gal99] and consisting in a step-by-step variation of an equally spaced chain 2 . In this 
way, we are able to suitably move the hetero/homoclinic points, in order to bypass the ergodization time 3 . 
This leads to a systematic reduction of the transition time allowing us to obtain a logarithmic estimate 
for T d . 

2. Preliminary facts and main result 

2.1. System at hand, invariant tori and whiskers intersection. Let us consider the system (1) where 
the perturbing function is supposed of the form F(q, (p) = (1 — cos q)f(<p), with 

f{ ( £)= ^2 fkcos(k-ip), (2) 

l*il+|* 2 l<r 

T is a fixed positive constant, and f\ = > for all k. In particular / is an even trigonometric 
polynomial. We suppose u to be a (C, r)— Diophantine vector 4 . 

First of all note that the unperturbed system possesses a two parameter continuous family of invariant 
tori 

T(A) = {(p,q,A,cp): P = q = 0, ip € T 2 }, 

with i £ R 2 . Due to term 1 — cost/ in the perturbing function, the entire family survives also in the 
perturbed system. This is of course one of the main advantages of this simple example. If one requires 
that two tori with different actions B_, C lie on the same energy level, these have to satisfy u- (B_—C_) = 0. 
Hence we have 

B = C + 5u ± , (3) 



namely, a transition chain in which the distance in the actions space of two consecutive tori is constant. 
The use of elastic chains is a key ingredient for a remarkable reduction of the transition time in the geometrical methods 
context. A review on the differences between geometrical approaches of [CG94] and of [Eas81] and how these are able to 
improve the speed of diffusion, can be found in [Forl2]. 

' i.e. the time necessary to approach a given point on the flat torus within a prefixed distance. 
4 There exist constants C/fl and r > 1 such that \u ■ k\ > C|fc| _T . 
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for some 5. So they form a one parameter continuous family. 

These tori are whiskered in the sense that each of them admits a couple of invariant manifolds W S ' U (T(A)) 
called whiskers, such that, by denoting with <!>' the phase flow given by (1), limj^oo <&*(z) G T(A) for 
all z G W S (T(A)) and lim^-oo G T(A) for all z G W U (T(A)). 

As can be easily seen by equations of motion, the whiskers (positive branches) W u {T(B_) ) and W s (T(C) ) 
possess the following structure in the unperturbed system 



W U (T(B)) = {(q,<p,p,A) : Po (q) := y/2(l - cosq), A = B, (q, cp) G (0, 2vr) x T 2 } 

W S {T{C)) = {(q,<p,p,A) : po(q) := y/2(l - cos?), A = C, (q, cp) G (0, 2ir) x T 2 } 

It is clear that these whiskers intersect each other if, and only if, 5 = and moreover the intersection is 
flat, meaning that the whiskers tangent spaces coincide at every point. Note that the whiskers possess the 
structure (trivial in the A) of graphs over the angles. 

By continuous dependence on parameters, the whiskers will keep the graph structure also in the perturbed 
case, for sufficiently small p.. So, for all positive 0(1) constants d, the whiskers can be written as 

W U (T(B)) = {(q,ip, Po (q) + W u (q,tp,fJ,), B + »A U (q , <p, fx)) , (q , tp) G (0, 2vr - d) x T 2 } 

W S (T(C)) = {(q,tp,p (q)+W s (q,V,»), C + ^A s (q,ip,fi)),(q,ip) € (d, 2tt) x T 2 } 

see e.g. [Gal93], also for its recursive construction at all orders. 
Let us denote with 



(4) 



Ap :=p u (q,^,fi)-p s (q,ip,n), AA := A s (q, <p, fi) - A u (q, tp, ft), 

and with 

F(q, <p,6):= dor 1 + fiAA(q, <p, p). (5) 

Now define 

d(Ap, AA) 
M :=VF= K —p — J . 

d(q,(p) 

It is an easy consequence of the isoenergy constraint between T(B_) and T(C_) that if there exists a point 
(q*,(p*) G (d,2vr - d) x T 2 such that F(q*,ip*, 5) = andrkM = 2, then 5 W U (T{B)) rtl W S (T(C)) 
at (q*,ip*). Actually, the condition rkM = 2 is maximal as by the conservation of energy one gets 

d q Ap = au-dqAA 
d Vk Ap = au-O^AA, k = 1,2. {0> 

with a := -l/(p (q*) + ftp 8 (q* ,(£_*, fi)), so rkM < 3. 

In this case we shall speak, as usual, of a transversal homoclinic point if 5 = (i.e. B_ = C_) and of a 
transversal heteroclinic point otherwise. 

Another key ingredient is the symplectic character of the whiskers 6 . By its use we can state the following 

Proposition 2.1. Suppose that, by defining as M se the south-east 2x2 submatrix ofM, det M se (q* ,</?*)> 
holds for some (q*,(p*). Then we have 

Proof. (Sketch) Due to their lagrangian nature the whiskers are gradient of two respective (family of) 
generating functions S u,s {q, <p, \i) so M is nothing but the Hessian matrix of S u (q, tp, p) — S s {q, ip, fi), 
computed at the intersection point. This implies that M is a symmetric matrix. By solving the system 
given by conditions (6) with respect to the entries of M se and imposing that det Mse > the statement 
easily follows. □ 



rtl denotes the transversal intersection. 
6 see [LMS99 
at all orders in /i 



6 see [LMS99] for a comprehensive treatment of this topic and a different algorithmical approach for the whiskers building 
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Now we are going to assume the following typical 7 

Condition 2.2 (splitting). The system (1) possesses a transversal homoclinic point at q = tt. In other 
terms there exists a point (q,<p) = (n,(p°) such that F(tt,p°) = and det M S e(7r, ip°) > with 
(Af se )« = 0(1). 

Remark 2.3. Clearly (tt, ip ) is a particular homoclinic point. As the whiskers intersection takes place 
along an entire orbit, all the points of the form (q, (p° +cjT 9 _ Mr ), where T q -^ is the time to evolve from tt 
to q, are homoclinic as well. If q is 0(1) bounded away from the origin, M se can be well approximated 
via Melnikov integral. It can be easily seen that M S e does not change (up to 0(p 2 )) if different section 
for q are chosen. 

The section q = tt is useful in presence of symmetries of the perturbing function as in this case. It is 
possible to show (see e.g. [GGM98]) that, by parity, (q, (p°) = (tt, 0) is a transversal homoclinic point at 
all orders in fi. We denote as ipP =: (ip^ip®). 
In the current setting, we state the following 

Lemma 2.4. Let us assume the splitting condition, and define the section 

£ := {ip G T 2 : ip 2 = p° 2 }. 

There exist 8 = 0(/i) and two functions q, <p\ defined on (—8, 8), with q(0) = tt and p>i(0) = ip®, such 
that, for all B_ and sufficiently small p, 

W U (T(B)) rh W S (T(B + <5w ± )) J 
on S at the point (q(8), ipi(8)). In particular 

for all 8 G (-8,8). 

The just stated version of this classical splitting result is a sort of counterpart of the statement which 
can be found in [Gal99]. According to the latter, as 8 varies, the ip coordinates of the intersection point 
move along a curve on T 2 , while q is fixed to tt. In our case q moves, but ip 2 is fix, allowing us to use a 
single section on T 2 for all values of 8 and then to perform an advantageous dimensional reduction. 

Proof. By hypothesis F_(tt, <p°, 0) = and rkM se = 2, so by implicit function theorem, there exist 8± 
and two functions q(8), (pi(8) such that ((/(0), <£>i(0)) = (tt, <£>?) and 

F(q{S),<pi(S),<pl8) =0, Vff €(-*!,*!), 

so (q(8), <pi(8), (p®) is a transversal hetero/homoclinic point. Note that, as all the entries of M are O(p), 
then 8 is O(p). 

Defined D := -[dq&Atf^ AA 2 - dqAA 2 d tpi AA 1 ]- 1 , we get 

d{q,Vi) ( ,, _-if d q AA x 8 tpi AA l VV d$F\ \_ _ x ( -d^Ap \ 
dS \d q AA 2 d Vl AA 2 ) \d & F 2 )-^ U \ d q Ap ) 

In which we have used (7b) and formulae (6). Property (8b) is a direct consequence. 

By (7a) and by continuity, there exists 8 2 > such that d q Ap ^ for all 8 G (— 8 2 , 8 2 ). By choosing 

8 := min{^i, ^2} ? (8a) immediately follows. □ 



the existence of a transversal homoclinic point is a standard result for this system; see e.g. [GGM98] for the same result in a 
more general context. Condition det A/se > (instead of det Afse 7^ 0) is a feature of this system and can be straightforwardly 
checked from the Melnikov integral (for sufficiently small n). 
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2.2. Transition chains. 

Definition 2.5. A transition chain of length A for (1) is a (finite) sequence of tori Tk '■= T(A k ) such 
that 

W u (T k ) rh W s (T k+1 ), Vfc = l,...,JV-l. 
By (3) the sequence {A k } is necessarily of the form Afy^), y k e R, where 

A(y) :=A° + u^-y. 

Set 5 k := y k+1 — y k , by lemma 2.4 every sequence Tk such that |<5 fc | < 5 for all A; = 1, . . . , N — 1 is a 
transition chain. 

Definition 2.6. We call equally spaced chain (ESC) a transition chain Tk such that 

5 k = {-5,0,5} 

for some fixed 5 < |<5. 

Now consider the sequence of intervals 

£ fc := (y k -C lf i,y k + C lfJ ), 
with Ci suitably chosen in a way that T(A(y k )) is still a transition chain 8 for all y k € Ek and for all /c. 

Definition 2.7. Let 7^* an ESC and let be its associate sequence of intervals. An equivalent elastic 

chain (EEC) to the given ESC, is a sequence Tk such that y k € E£ for all k. 

In this setting we have the following 

Theorem 2.8. Let N = 0([i~ l ) and sufficiently small fi. For all ESC of length N there exists an EEC of 
the same length and a set of initial data U such that the solution of (1) starting in U shadows 9 the EEC 
with a time 

r = o(-ln-V (9) 

As an immediate consequence we get 

Corollary 2.9 (Arnold's Diffusion). The Arnold's Diffusion takes place in the system (1) on a set of 
initial data lAd <ZU with a time given by (9) 

Proof. Choose the EEC given by 5 k := 5 (or —5) for all k and apply theorem 2.8. As 5 k = O(p), an 
0(/U _1 ) number of transitions leads to a drift of O(l). □ 

Remark 2.10. As the set of all ESC of length N has cardinality equal to 3^, the drift theorem 2.8 
shows that the system ( 1 ) possesses a rich behaviour. In particular, every step on the transition chain is 
conjugate to a random walk on a ternary tree. Arnold's diffusion is simply associated to the two lateral 
branches of the entire tree. 

3. Flow approximation and windows 

This section is devoted to fix the background material in order to prove the theorem. First of all we are 
going to construct a suitable approximation (in the "linear map + remainder" form) of the Hamiltonian 
flow close to a generic sequence of hetero/homoclinic points associated with a given transition chain. 
This is obtained in a standard way, that is, by a composition of two maps: the first one describes the 
"inner" motion close to the invariant tori and the second one the "outer" motion from one torus to another. 
As a difference from [Mar96] and related works, we use Gallavotti's normal form instead of the Graff- 
Treshev one, giving a different criterion to determine the Jacobian matrix of the outer map. We also 



It is sufficient to choose Ci < S/(16fi). 

'we do not need here a precise definition of the concept of shadowing as it will be intrinsically given by using windows. 
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recall some basic notion on windows and their main properties for the reader convenience. For a more 
efficient comparison with our result, we shall follow notations of [Mar96] and of [CG03] very closely 
from now on. 

3.1. Inner dynamics: normal form. Let T(A*) be an invariant torus for the system (1). Fixed R, R, 
let us consider the phase space region defined as 

U krK (A*)-={^^P^):\p\,\q\<R, \A-A*\<ph, <p G T 2 }, 
that can be regarded as a "neighbourhood" of T(A*). 

Theorem 3.1 (Gallavotti, 1997). For all A* and [i small enough, there exist R, k, R, k independent on 
[i and a canonical map 

C : (q,<P,P,A) G U k ~ K {A*) (Q,±,P,D G U R , K (I*) 

with 

U R , K (I*):={(Q,±,P,l):\P\,\Q\<R, \L-L*\<^ V^GT 2 }, 
and (0, tp, 0, J*) = C(0, tp, 0, A*), casting the Hamiltonian (I) in the following form 

H(Q,±,P,r)=uL-I + J li (ri). (10) 
with rj := PQ and where Jfj,(r]) is an analytic function on Ur )K x (0, p,o)for some p,® > 0. 
In the new variables set, the system is clearly integrable and its evolution is given by 

(Q(t),P(t),±(t),I(t)) = (Q(to)e^\P(t )e-^\l(t )+ij(t-t ),l(to)) (11) 
where g{rf) := J M (??) and g(0) =: g. 

The proof of this result can be found in [Gal97], and goes along the lines of the (more general) anisochronous 
case proof of [CG94]. In order to clarify some consideration we shall need later, let us recall that the 
canonical map C is the composition of two maps: the Jacobi map Cj 

QJ = i^ P \ (12) 

pj = R{q,p) 

(straightening the invariant manifolds of the unperturbed pendulum) and a KAM map Ck (a 4 close to the 
identity) 

qj = Q + ^L(Q,^P,p) 

V = ± (13) 
Pj = P + pM(Q,±,P,p) {Li> 

A = 1 + fiN(Q,^,P,fi) 
whose effect is to remove the perturbation at all orders. 



3.2. Reduced system of coordinates. In the neighbourhood Ur jK (I_*), is useful to define a reduced 
system of coordinates. This standard approach is systematically used in such cases (see, e.g. in [Eas81] 
or [Mar96]) and can be obtained as follows 

(1) Fix a Poincare section on T 2 : due to isochrony, every straight line on T 2 whose direction is not 
aligned with lo, is a transversal section. The simplest choice is 

S^v := {^T 2 : ^ 2 = ^}, 

for a complete interface with the "external" section S. In such a way we get a solutions dis- 
cretization, and the sampling time t* is constant because of isochrony. Without loss of generality, 
we shall suppose t* to be equal to 1. 



this result holds with a more general perturbing function, in particular, it does not require the presence of the term 1 — cos q. 
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As u is a Diophantine vector, the recurrence frequency on Sjy, denoted as v, is a (C, r) — Diophantine 
number. 

(2) Set an energy level A4(h) := {(P, Q,i/j,f) : H = h} and suppress the action I2 by restricting 
to M(h). 

By denning 9 := ipi and p := /1, the coordinate system of the restriction to V h (p*) := Ur jK (I_*) n 
Stv n ,M(/t) is simply (Q,9, P, p). By taking into account of (11), the images of the section map 
/ : V h {p* ) ->■ V h {p* ), easily write as 

r(Qo,e ,Po,po) := (Qoe^)^ + w ,p oe -^),p ), (14) 

where 77 = Q0P0. If we write 

1 + [ ±nrjg(x) + -(nr]g(r])) 2 ± . . . 



e ±ng(r)) _ e ±ng(0) 



:L ±n [l+F ± ( V )], 



2 

the map (14) can be expanded in a neighbourhood of X fc in the following way 

l n (Xk + = {L n {Qk + 6), k + 6 + nu, L~ n ^p k + £4) + R(X k ,0, (15) 

with 

= (L"(Q fc+1 +6)^(r/),0,L-^3i ? +(^),0), (16) 
where in such case rj = (Qk+i + £i)£3- We also denote with / n the linear part / n — R. 

3.3. Sequences of hetero/homoclinic points associated to ESCs and EECs. By lemma 2.4 we can 
associate with an ESC 7fc a sequence of hetero/homoclinic points (q k ,tp k ) := (q(S k ), <Pi(5k)) on S. 
We denote with I G T, the sampling of the phase flow on E. 
Let A; be arbitrarily chosen and let 

z k := (q k , <p k , cp° 2 ,Po(q k ) + pp u (q k , (p$, <p%, p),A(y k ) + pA u (q k ,<p k ,$, p)). 
By construction z k e M (1 M(h) nT,. 

As lim^.oo &(z k ) = T k and limj_ H _ 00 &(z k ) = T k +i, for a sufficiently large |l| we get ®~ l (z k ) G t/ fc 
and (z k ) G £4+1- In this way, these points can be written in the respective normal coordinates of the 
tori. In particular, there exist l k G N such that 11 

(C o ( lfc )) Q < i?/2, (C o (^)) p < 
In the reduced system these points reads as 

X k = (Q k ,9 k ,0,p k ) := (Co^(z k ))\ Vk 

x! k = (o,e' k ,p> k , Pk+1 ) ■.= (Co¥i(z k ))\ Vk+1 - 

Let us consider an ESC, for all EECs associated with the given ESC, by following the previous construc- 
tion we get a double sequence of hetero/homoclinic points {X k {y k ), Xl(y k )} k= i . f° r ai l Vk £ E k . 
If necessary, we could reduce the range of variation 6 (i.e. C\) in such a way Q k (y),P k (y) < (3/4) i? 
for all y E E k . 

The key point is that, due to isochrony, by varying y in E k (of an O(p)), we can move the coordinates of 
the hetero/homoclinic point (of an O(l), see (8b)) on T 2 without ever falling out of S^r, i.e. by functions 

6 k (y):= Vl (5)-l k u, 9' k (y):=<p 1 (5)+l+v, (18) 

where 5 := y k+l - y with y k+1 fixed in E k+1 . 

This is the advantage in our formulation of lemma 2.4 with respect to the setting used in [Gal99]. As £jv 
is one dimensional the problem of finding y such that e.g. 9 k {y) is equal to some given 9* in the image 
of 9 k (y) is achievable in a constructive way 12 . 



1 denote e.g. with (v) vi the Vi component of v. 
compare with comments in [Gal99, pag. 307 and 309]. 
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As a direct consequence of the normal form, the manifolds W u (Tk) and W s (7fc+i) in V k and V k+ i 
respectively, are given by 

W u {%) = {(Q, 9, P, p) : P = 0, p = p k }, W s (T k+1 ) = {(Q, 9, P, p) : Q = 0, p = Pk+1 }. 

As l k = 0(1) (see remark 2.3), by writing down the expression of the whisker W s {T k +i) in terms of 
normal coordinates 13 , it is possible to recover (up to higher orders) two smooth functions P k+ i(Q, 9; y) 
and p k +\(Q, 9; y) such that the connected component of W s (T k +\) intersecting W u (7~ k ) at X_ k {y) in V k 
is parameterized by 

Y k+1 (Q, 0; y) := {(Q, 9, P k+1 (Q, 9; y),p k+ i(Q, 9; y))}, Y k+1 (y)(Q k ,9 k ) = X k (y). (19) 
Furthermore, the normal splitting matrix 

M N :=(Yn +1 d f n k+1 )(Q^^y) (20) 

V VQPk+i OePk+i J 

satisfies 

(Mjv)h ± 0, rkM N = 2, (21) 

by virtue of (7) for all y G E k (if necessary we could reduce C\ further). As a geometrical consequence, 
we have that W s (T k +i) is still a graph over the angles in the normal (reduced) coordinates system. 

3.4. Outer dynamics: transition map. As we have written by /" the evolution in V k , we want to give, 
here, an approximation of the Hamiltonian flow from V k to V k+ \. For this purpose, let \P : V k — > V k+ \ 
the Hamiltonian evolution implicitly defined by 

MX k )=r k - (22) 

By regular dependence on initial data, there exist neighbourhoods V k of X_ k and V' k of X/ k respectively, 
such that maps V k into V' k and, for all X_ k + £ € V k can be written in the form 

MK k + = X' k + A£ + R(K k ,0- C 23 ) 

We have denoted by A := T>^_{X_ k ) the Jacobian matrix of the map computed at the hetero/homoclinic 
point and by R(X k , £) the remainder that in terms of components reads as 

R\X k ,0= I (l-t)CH¥(X k + tO-Z T dt, i = l,...,4 (24) 
J o 

% denotes the Hessian matrix operator. We denote with the linear part if — R. 
The map also known as separatrix map, see e.g. [Tre02], is a standard and widely used tool to 
construct trajectories close to invariant manifolds. The approach we are going to use, in order to describe 
the structure of A, is of geometrical nature and it seems to be very close to the one used in [Tre02]. The 
matrix A is essential for a non-trivial approximation of the flow, however it is not difficult to understand 
that this matrix conceals a complicate structure in our case. For instance, let us denote with <&1 the 
Hamiltonian flow (in global coordinates) mapping V k in V' k and C k ^ k+l the canonical map C with the 
coordinates reduction to V k and V k+ \ respectively. So, by construction we have 

MQ, 9, P, p) = {C k R +l o <D' fc o {C h R y x ){Q, 9, P, p). 

As one can easily deduce by (12) and (13), this computation turns out to be a very difficult task 14 , even 
at the first order in p, unless one would like to resort to a numerical approach. 
We state the following 



so backward evolved by a time l k . 
14 the main difficulty arises if one write down the perturbing function in terms of Jacobi's coordinates and then attempt to 
perform a perturbative step. 
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Proposition 3.2. The matrix A for the system (1) takes the form 



A 



( 



V 



o 



-l/a 




a \ 





1/ 



/ an 


«31 
\ a 4 i 



an 




«13 





Cl32 033 

042 043 








(25) 



with a / 0, cotg? 15 fly := a^ (/x). These satisfies, for sufficiently small p 

au / 0, D := aiia 42 - 041012 / 0. (26) 

Note that the above structure holds at all orders in p. We denote with Aq the unperturbed part of A. 

Proof. If p = the system is integrable, then rotators and pendulum are uncoupled. Moreover, the 
canonical map C reduces to the Jacobi's map for the pendulum, whose Hamiltonian takes simply the 
form J (PQ). 

Let IT the projection operator on the (P, Q) plane. By conservation of energy, for (P, Q) € HV k and 
(P', Q') € nVfc + i we get Jq(PQ) = Jq(P'Q'). Now consider a small increment of (Qk, 0) of the form 
(Qk + 0", S) by denoting with (f(a, 5), P' k +g(cr, 5)) corresponding variations in ITVfc+i- It is understood 
that f(5, a) and g(5, a) are regular functions by definition of and such that /(0, 0) = g(0, 0) = 0. 
Denote with the entries of §^fy (0, 0). By conservation of energy, turns out to be, up to higher orders 
in (5, a), 

UllU 2 l = 0, 1il2^22 = 0, UnU22 + U21U12 = 1. 

Now recall that the intersection between W£ and W% +1 is flat, then increments in direction (Q, 0) in II Vk 
imply displacements in direction (0, — P) in III4+1, from which U21 < 0. So, by the previous equalities 
we get un = 0, so U12 = l/«2i and then U22 = 0. It is sufficient to define a := —U21 and note that 
the Jacobian matrix in the (9, p) variables is trivially the identity matrix, to get the required form of the 
unperturbed A. 

Now suppose [i ^ 0. By definition, if is restricted on the graphs of the functions Pk+i (Q, 9) , Pk+i (Q, 9) 
(parameterizing W% +1 in Vk), its image has to produce points on W% +1 in Vk+i and then such that Q = 
and p = pk+i- In other terms, for sufficiently small there exists f > such that, for all (a, e) € Bf (0) 



^1 (Qk + cr,9 k + e, P k+ i (Q k + a,9 k + e), p k+ i (Qk + cr,9 k + e)) 
^i(Qk + cr,9 k + e, P k+ i(Q k + a,9 k + e),p k +i(Qk + cr,9 k + e)) 
By expanding the first equation in a neighbourhood of (a, e) = 0, we get 





Pk+l 



an +«i3-g^ i + ai4 — 



ai2 + ax 3 



OP, 



fc+1 



+ au 



dQ 

OPk+l 



(27) 



(28) 



where we have recognized the elements of A. By the unperturbed form of A we know a>ij = p,ah +o(p) 
for j = 1,2,4 while 013 = a + ua} 3 + o{p). On the other hand, the derivatives appearing in the previous 
equation are exactly those of the splitting matrix (20). Then, by denoting with 

/ , , \ / 9P k+ i dPk+i 

I tlM til'?. \ I ~. 

A 1 



hn 

fl21 



hi2 

h-22 



dQ 
dpk+i 
dQ 



dpk-i 



and equating powers of « in (28) we get a n = —ahn and a\ 2 = — a/121 - First of all we note that an ^ 
for u ^ by (21a). In similar way, the second equation of (27) yields a^ = —h^i and a\ 2 = —^22- 
Then, by keeping in mind the definition of D, follows 

D = a(h u h 2 2 - h 12 h 2 i), 



15 In spite of the dependence also on k (as A is the Jacobian matrix computed at X_ k ), we suppress the index k to avoid a 
cumbersome notation. 
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that is, up to higher orders, the determinant of Mat: the latter is non-zero by (21b). 

Note that the argument is until now quite general, and can be reasonably extended to more general 

systems. 

Now we attempt to get some additional information by using the very simple form of our system, in order 
to simplify, as much as possible, the structure of the perturbed part of A. 

Let us consider two matrices .61,-62 £ M(n, m) for some n, m € N. We say that B\ and B2 possess the 
same structure, and we denote this property with B\ _6 2 , if B\ and B2 have null entries exactly in the 
same position(s). 

So, by (12) and (13) the Jacobian matrices of the canonical maps satisfy 



DCj = 



( Sg 




Rq 


V 





1 








Sp 



R p 





\ 






1 / 



DC K = I + 



( L Q 


Lip! 


Llp 2 


L P 





\ 






































Mq 






M P 








Nh 
\4 


4 


% 

4 


N P 
N P 








) 



where I is the identity matrix. Both matrices are computed, by definition, at the hetero/homoclinic point 
and the entries of the second one are functions of \i. 

First of all is clear that DCj DCJ . Furthermore, it is easy to check from (13) that DCk ~ DC 



K 



Now, simply by looking at the equations of motion, we can see that the pendulum evolution does not 
depends on the action variables at all orders in /i. This implies that the actions evolution (also due to 
the independence of the perturbing function on A) is independent on A at 0(/U fc ) for all k > 1. So the 
Jacobian matrix of the flow satisfies 



( 


X 








X 





\ 




( 


X 


X 


X 


X 





\ 







1 












































1 











+ n 
























X 








X 










X 


X 


X 


X 






















1 









X 


X 


X 


X 








\ 

















1 ) 




\ 


X 


X 


X 


X 





Q ) 



where the entries of the second matrix are functions of /i. 
In conclusion, by definition, we have that A := DCk ■ DCj ■ D& ■ DCJ 1 ■ DC 
to dimension 4 we get the perturbed part of (25) . 



K 



D& 



By reducing 

□ 



Remark 3.3. Clearly, the previous argument does not guarantee that the entries of A marked by x are 
non-zero since some of them could vanish due to some symmetry of the product. As we shall see, the 
only key properties we shall need are given by (26): it does not matter if the other entries vanish or not. 
In this sense we agree with the topological criterion used in [Mar96, p. 239] to give the structure of his 
matrix A. 



3.5. Windows. 

Definition 3.4. Let M C M d , and d h ,d v G N such that d h +d v = d. let L h := [-1, l] dh , L % 

and Q := Lh x L v . 

A window is a C 1 diffeomorphism 



-1,1]' 



W:Q^M. 
We denote with W the image 16 of Q through W_. 

The horizontals of W are defined as the images of W(xh, x v ) where x v £ L v is kept fixed and varies 
in L/j. Similarly, the verticals of W are defined as the images of W(xh< x„) with Xh € fixed and 
variable x v € L v . 



16 



we shall refer as widow either the map or its image. 
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The point W_(0) is called centre of the window. We shall refer to non-degenerate windows as windows 
containing a non-zero volume. Even if a non-necessary condition, it is natural to think the integers dh 
and d v as satisfying dh = d v = d/2. Actually, in the applications, d is the (even) dimension of the 
(eventually reduced) phase space. 

An interesting case is represented by affine windows. This class of windows arises when the function W 
is linear, and then can be written in the form 

W a (x) = c + Wx. 

The point c is the centre and W is a d x d matrix we shall call representative matrix of W a . Clearly, a 
window is non-degenerate if det W ^ 0. 

Definition 3.5. We say that a window W_ x is correctly aligned on (c.a.o.) W_ 2 if for all y v G L v and for 

all yh G Lh there exists unique Xh G (—1, l) dh and x v G (—1, 1)^ such that 

K 1 (x h ,y v ) = W 2 {y h ,x v ), (29) 

and the intersection is transversal. 

The notion of correct alignment leads to the following key result 

Theorem 3.6 (Shadowing, Easton). Let \T>k}k=i ... N ^> e a collection of open sets ofR d , ^ k : 2\. — > 
T>k+i a family of diffeomorphisms and W f a family of windows W_ k : Q — > T> k . Let, for all k, W h := 
*fc o W k . 

If W'j,, c.a.o. Wf. 1 1 for all k = 1, . . . , N — 1 there exists (at least) a point x* ofW\ C T>\ such that 
^at_i o . . . o ^(x*) G W N C V N . 

For the proof, given in a more general context, we refer to [EM79]. 



3.6. A correct alignment criterion. The following statement, due to J.P. Marco, give us a set of suffi- 
cient conditions in order to establish if a window is correctly aligned on another one. First of all, some 
notations: 

Let M(n, m) := GL(R, n x m) be the linear group of the n x m matrices with real entries. For 
all v G R d let II tjjjoo :— maXj = i r .. jC j \vj\. Consequently, for all A G M(n,m) we denote H^Hoo :— 

Em i i 
j=l \ a ij\- 

Now, let F(x) G C k {U,R n ), k > 1. The standard C 1 norm over U is denoted as 

\\F{x)\\ c i {u) :=max{sup||£(x)|| 00 ,sup||i?£(s)|| 00 }. (30) 

Lemma 3.7 (Marco, 1996). Let 

Wf = c x + Wix, W_l = c 2 + W 2 x, 
be two affine windows, where c 1 2 G M. d , and W\^ 2 G M(d, d) are of the form 

w _ ( W i W i \ 

Wi -{wi Wf)' 

with W} G M(d h , d h ), Wi G M(4, d v ), Wf G M(d v ,d h ) and W? G M(d v ,d v ). 
Now define the "intermediary matrices" 

(wl -wi \ ( -wl wf \ 

y W 2 _ W 4 ) , * - y _ W 2 W 4 j , 

and 

X a := supllM-^x + M- 1 ^ -cJHoo. (31) 

26 G 
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Suppose that M is invertible and \ a < 1- Then W \ c.a.o. W o- 
Now consider the following windows 

W^x) = W\{x) + W^x), W 2 (x) = W%(x) + W 2 (x), (32) 

where the maps W_ i (x) 6 C 1 (U,M. d ), andU is an open set containing Q. And define 

x c ■■=\\M-\w 2 (x)-W 1 (x))\\cHuy 

Suppose that 

cl. x a < 1> implying that Wf c.a.o. W %, 
c2. 

X c <j, x a + T^<l- (33) 
4 1 - x c 

77j<?« c.a.o. W_ 2 - 

As in [Mar96] it is natural to choose U := (—2, 2) d . A detailed proof of lemma 3.7 can be found in the 
already mentioned paper or, more closely to the above formulation, in [LM05]. 

3.7. A suitable class of windows. Our aim is now to construct a prototype of non-degenerate affine 
windows for the problem at hand. Following [Eas81] we choose a class of windows satisfying 

Condition 3.8. ( 1 ) The horizontals of B(x) are parallel to T Kk W%, 
(2) The verticals of AB(x) are parallel to 17 T^W^ +V 

Before restricting ourselves to the particular form of A given by (25), we give the following 

Proposition 3.9. Suppose A = Aq + pA\ where {A\}ij are generic functions of p satisfying (26). There 
exist (at least) oo 4 non-degenerate windows satisfying condition 3.8. More precisely, let &i,..,4 £R\ {0} 
be such parameters, the simplest 1 ^ representative matrix takes the form 

( bx 6 3 
B= 6 2 64 
nT, 33 b 3 

\ ^£4363 ^£4464 

such that det B ^ 0. Y>ki are suitable functions of a^. 

This structure allows us to simplify as much as possible the computation of the intermediary matrices 
without any loss of generality. 

It will be useful to isolate a scaling factor in the parameters 6j. For this purpose we redefine b{ =: pPbi, 
and from now on b{ are O(l) constants and p € N is a control parameter of the window size. 
As in this circumstance, the particular structure of the perturbed part of A given in (25) does not lead 
to substantial simplifications, so we prefer to bring for the moment a generic structure. In any case the 
condition D/0 points out a clear relation between splitting and condition 3.8. 

Proof. Consider a generic B = {hj}, the above mentioned tangent spaces writes 

Tx k W£ = {P = 0, p = Pk }, T K ,W£ +l = {Q = 0, p = Pk+1 }. 

Let us denote with U{ the canonical basis vectors of Q and with those of M 4 . Then conditions 3.8 and 
non-degeneracy take the form 

(1) B^ •£,- = () i = 1,2 j = 3,4, 

(2) ABui ■ ej = i = 3, 4 j = 1,4, 

7 This item is an easier to handle condition than "the verticals of ^ o B_(x) are parallel to Tx' k Wfc+i "> but the geometrical 

meaning is the same. In this way, since Tx_ k W^ and Tx_ W% +1 are transversal by virtue of the splitting, horizontals and 

verticals of the window are transversal too, implying the non-degeneration property. 
1 8 

i.e. with the lowest number of non-vanishing entries, up to an entries interchanging. 



(34) 
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(3) B invertible. 

The first condition is satisfied if 631 = 641 = 632 
linear systems 



where 



/l(/-i)&33 + ^14^43 = fJdi 

M043633 + h(jj)hz = 



642 = 0, while the second one leads to the following 



/i(a0&34 + H a u b u = l^d 2 

^43634 + / 2 (^)^44 = H d 4 



fi(/j,) := (a + na 13 ) 
/ 2 (//) := 1 + ftau 



di 
d 3 



■(ttH&13 + 012^23) 
"(041613 + O42623) 

-(an 614 + 012624) 
-(041614 + 042624) 



By setting 6(h) = fi(n)f2(H) — H %a \^ a ^> = a + 0(/x) that is non-zero for sufficiently small //, the 
above systems give 



633 
643 



,. fMa{it)-Afoi4fj3 

,, ^3/lW-^a43^i 
^ «5( M ) 



634 
644 



,, ^2/2(A t )~/ ja 14^4 
,, ^4/i(A t )-/Ja43^2 



(35) 



Note that, by the previous solution and by definition of 6(h), we get 

,2 



633644 



643634 



J£_ 
6(h) 



D(b 13 b 



24 



6236 



23^14 ) 



The simplest form of B is then achievable by choosing 

623 = 614 = 621 = 612 

in such a way 



0, 



det£ 



6(h) 



611622613624!). 



(36) 



(37) 



(38) 



Now define b\ := b\\, 62 := 622, 63 := 613 and 64 := 624- By hypothesis D ^ 0, so B is invertible. 
In conclusion, by substituting the simplified di .. 4 (by choices (37) in (35)) and defining 



^33 : — " 
S34 := - 

we get the required form. 



-an+/j(ai4a4i — an 044) 

-ai2+M a i4 a 42 — ai2Q44) 
<5( M ) 



5^43 
S44 



-aa4l+/j(ana43— 013041) 
<S( M ) 

-aa42+M a l2"43-ai3"42) ' 
<5(M) 



(39) 
□ 



4. Proof of the theorem 

By using the preparatory material of the previous section, we want to prove theorem 2.8 via WM. The 
construction is standard: we consider the evolution through of a generic window B_ k ("close" to the 
k— th torus) obtaining a window W_[ and the backward evolution through f~ n of a window B_ k+ i (close 
to the k + 1— th torus), getting Wo- Now we wonder if centres and representative matrices of B_ k k+1 
can be determined in such way W \ c.a.o. W\. The windows B_ k possess, by construction, a "shape" that 
makes this property easier to obtain. Nevertheless, the absence of an anisochrony term, implies that the 
windows, if constructed as in [Mar96], are not "deformed" in a suitable way to get correct alignment. 
This will be clear imposing the condition required by lemma 3.7, we use here as a guideline for the proof. 
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4. 1. Affine problem. Let us consider a family of affine windows of the form 
defined in the neighbourhood of X_ k 



P k + B k x. 



v n + G{n)p k+1 , 
G(n)B k+1 . 



(41) 



P k = ((T k ,0,5 k ,0) + X k , (40) 
with B k ,a k , 5k to be recursively determined. 

Now consider the actions on B_ k and B_ k+1 of the linearized maps and / n respectively. Our purpose 
is to determine the windows centres and representative matrices in a way to satisfy the alignment test. 
We get 

Si = Xl k + A(a k ,0,5 k ,0) T , c 2 : 

Wi = AB k , W 2 

Where G(n) = diag(L~ n , 1, L n , 1) and v n = (0, -nv, 0, 0). 
Recalling (25) then using (34) and (39) the product AB k reads 

/ /xan&i /U012&2 
6 2 

Koifijh /J,a 32 b 2 Ki(n)b 3 

\ l^dAih lia&bi 

where 

#oG") : = -l/a + /ia 3 i 

( u \ .— a+M(ai3-Q3ia 2 )+A' 2a ( a li a 33-ai3a3i) 
IvrV ' a(a+/Ltoi3 ) ■ 

«32«— ^("12^33— Q13Q32) 
a+/^ai3 



AS* 





[iK 2 {\i)b± 




\ 



/ 



(42) 



4.1.1. Alignment test part one: intermediary matrices. By using the obtained simplifications, the inter- 
mediary matrices read as 



M k 





K (a*)6i 
\ A*a4i6i 



^ai 2 b 2 

b 2 
Ha 32 b 2 
fia i2 b 2 



and 



/ -ciZ- n 









-C2 






-c 3 L~ n 


-^^43C3 








\ 

-c 4 

-//L rl S34C4 

-/1S44C4 y 





64 

if 2 (A^4 




where 



b 



k+l 



(44) 



so we have denoted by Cj the entries of B k+ \. 

As required by lemma 3.7 we have to compute the inverse of M k , however its explicit expression is 
quite complicate and cumbersome. We prefer a "perturbative" approach of this computation, by setting 
L n = and suitably choosing /3. This leads to the following 

Lemma 4.1. For all p 6 N, there exists (3 P G N smc/z f/jaf, /or a/Z /3 > /3 P , f/ie matrix M is invertible and 
its inverse admits the following expression 



( 



Ml 



-p-i 



2D£ 



:-;:-! 



D-a42S33 

61 

"41 £33 

C3 
1I41S33 

C4 



P c 3 
_,,S33_D 

^ C 4 



^12^33 \ 
bl » 

«llS34 



33 



(45) 



C4 
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where D : = £33X44 — S34S43 = D/5(p). Moreover, the explicit expression of the third column is 

/ _ C112S43 ^ 



p-p-i 



2DY, 



33 



V 



11^4 



ail ^43 
c 4 



+ 0(ji 



(46) 



/ 



As P does not depend on \i this result holds at the price of a multiplicative 0(1) constant of the transition 
time n(/3) := log L pT^ = 0(log p~ l ). 



Proof. It is a straightforward check. By choosing 

&:=4(p + l) 
one gets, for sufficiently small /x and (3 > (3 P , 

detM fc ~ -2^ 4p+3 - /3 6i6 2 c 3 C4 J DS33. 



(47) 



(48) 



that is non-zero by (26). The adjugate can be computed in the same way, by using the dominance of 

P □ 



terms containing p 
By using the already obtained form of M^ 1 and by defining 

the product M^ l N k yields, up to 0(//) 

M^N k -- 





«12 


7 := 


an' 


( ° 


7fr 





C2 
b 2 





' C 3 


v° 


£2 

C4 



(49) 















7- 

'C3 




_2i 

C4 





-2* 



Now, recalling (44) and reintroducing the dependencies on k, it is easy to see that if b\ = b\ +1 for some 
k, even in the simplest case 7 = 0, one gets 

1 



\M k - X N k \ 



1 + max 



6| 6| 
&§' of 



> 1, 



and the affine alignment test fails. 

The situation is quite different in the anisochronous case, in which the 9 component of the map / 
writes as (see [Mar96]) 

f^ n {e, P ) = e-{u + u xP )n. 

In this way, one gets an estimate of the form 

\\M^ x N k \ 



< 



\av\n 



21' 



(see [Mar96, Pag. 248]), where a = 0{p) and n is kept sufficiently large. 

Due to the presence of the term v\, the quantity \\Mj~ iVfc||oo tends to zero, as n increases, and the 
alignment test can be satisfied. 

This looks as the consequence of the "transversality-torsion" mechanism pointed out in [CG01]: the 
joint action of splitting (a ^ 0) and torsion (ui ^ 0) transforms the partial hyperbolicity (i.e. in the two 
variables (P, Q)), in total hyperbolicity (in the four variables). In this way windows are compressed- 
stretched also in the (9,p) variables. This feature, already mentioned and used in [Eas81], is clearly 
stressed in lemma 4.3 of the paper [Cre03]. In such result, eigenvalues of the map C : Q — > Q defined 
by 

£:= 2T 1 o f n o$oB 
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are explicitly computed, showing that for sufficiently small fi and large n, these are real and of modulus 
not equal to one. Consistently if av\ = 0, the modulus of the eigenvalues corresponding to directions 
(9, p) is equal to one and the phenomenon vanishes. 

So the presence of torsion looks as an advantageous property in the WM machinery we cannot use. 
Nevertheless, by a careful choice of the representative matrices, as depending on k, it is possible to 
obtain a sort of "simulation" of the torsion effect: 

Lemma 4.2 (simulated torsion). Let N = 0(1/ n). There exist an 0(1) positive constant K and a 
sequence {Bj,}k=i,...,N € M(4, 4) of representative matrices of the form (34) such that the resulting 19 
W\p give rise to intermediary matrices satisfying 

WM^NkW^Kl-Kp 2 . (50) 

for all k = 1,...,N. 

As it is evident, this result gives a thin (but necessary) layer in order to satisfy the next estimates. 

Proof. Suppose 7^0, the simpler case 7 = is an easy consequence. 
Condition HM^iVfc Hoc, < 1 gives the following system of recursive inequalities 



(51) 



redefining b\ := \b\ |, then the solution makes sense only if b k > for all k. 

Now note that as neither b k+1 nor b k appear in the previous system, the choice of b k and of 63 is free. 
Hence, the first and the third equations are trivially satisfied. In this way the previous system is satisfied 
(by a suitable choice of b \ and 63) for all k if the following system 

b^ + bh = Cb% 



uk+1 
°2 


+ b k 


< 


hn 


uk+1 
°2 


+ b k 


< 


2b\ 


uk+1 
°2 


+ b k 


< 


M 3 


uk+1 
°2 


+ b\ 


< 


2b k+1 



J 2 ' U A — K - JU 2 

b k 2 +1 + b\ = Cb k 4 +1 

is, with C < 2. By setting Xk ■= b k , the latter gives the following one dimensional discrete initial value 
problem 

x k+2 = Cx k+1 -x k , k>l 
x 2 = b\ 
x\ = b\ 

Fixed N, we are interested in a solution such that x k > for all k = 1, . . . , N. 
If C < 2 the solution takes the form 

x k = K\ cos(/c — \)a + K2 sin(fe — l)a, k > 1, (52) 
where a = arctan ^ A ~ c ^ . 

By choosing X2 = (C/2)x± we get K2 = 0. Moreover, by setting C := \J A. — p?b 2 where b has to be 
determined, we have, a = p\ + 0(/j, 3 ). In this way (52) reduces, up to higher orders, to 

x^ = x\ cos ( /c/i^ j (53) 



for all k = 1, . . . , N. So x k > for all k = 1, . . . , N if kp\ G (0, vr/2), that is 

b 



2 

TT 



in this computation the centres p fe does not appear and then these will be determined later. 
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Now come back to the original set. Recalling the definition of the sequences b\ in terms of x k we see 
that the remaining two inequalities of (5 1) are satisfied by taking for all k 

(we have used that 2/0 > 1) for all k (63 is arbitrary). 

By keeping in mind that xi := b\ is arbitrary, the remaining initial conditions are determined. The datum 
b\ can be chosen as b\ = (C /2)b\. 

Note that = 2- \p 2 b 2 + 0(/U 4 ), so, by defining K := \b 2 + 0(/i 2 ), we see that b has to be an 0(1) 
constant, otherwise estimate (50) and approximation (53) do not properly make sense. This implies that 
N = 0(1/ p), a fully compatible choice with our purposes. 

□ 

Remark 4.3. Note that variable parameters 6 2 4 are involved in the (6, p) part of the windows, while 
entries b\^, relative to the hyperbolic part can be chosen as constant during the evolution. In this sense 
we simulate the torsion effect. 

Remark 4.4. The decay of 6 24 implies that the determinant of B k , and then those of M k , approach 
to zero. Obviously, by hypothesis, zero is never attained but the entries b\ 4 reach values which are no 
longer O(l). The smallest value of these entries (we get it for k = N) is immediate by (53) 

b" = b\ cos ( -J^—] ~ b\— = O(p). (54) 
2 2 \2N + l) 2 2N yh; 

In such a way (48) changes as follows 

detM k \ k=N ~ -2^-P h N-i h N-i h N h N D ^ (55) 

As an uniform estimates is handier, we extent condition (47) to 

pp := 6 + 4p, (56) 

for all k. 

4.1.2. Alignment test part two: estimate of the term M k ~ 1 (c 2 —Qi). Our aim is now to determine centres 

p k in such a way the alignment test is satisfied for all k. 

Set A := (c 2 — Oy). By (50), for a successful test we need, for all k 

II^AHoo < \kii 2 . (57) 

This means that centres of windows W \ and W % should coincide "as much as possible". For this purpose 
we shall use the EEC mechanism in a very profitable way. 

Lemma 4.5. Let us consider the sequence B k constructed before. For all ESC there exist an EEC, 
sequences (o~k,$k) £ I^ 2 an d n k+i £ K in such a way the windows W\ 2 (obtained by B_ k and now 
completed by (40)) are correctly aligned for all k. 

Furthermore, for all k = 1, . . . , N — 1, the transition time from T k to T k +i satisfies 

n k+1 = 0\\n^\. (58) 

The previous formula, as N = 0(/i -1 ), gives immediately the transition time estimate (9). 

Proof. We have to choose parameters 5 k and a k in order to satisfy (57) for all k. It will be simpler to 

keep 5 k+ \ and cifc+i fixed and then determine 5 k , a k . 

Keeping in mind (41), the two windows centres take the following form 

c x = (fxcTkan + fi(p)5 k , 8' k , P k + fia 33 5 k + K (fj,)a k , p k+ i + pa^(j k + pa i3 S k ) 
£2 = (^(Qk+i +o- k+ i),9 k+1 -nv,n-PS k+ i,p k+ {) 
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In order to satisfy (57), we try to nullify as much components of A as possible. This is achievable with 
first and fourth component via a suitable choice of 5 k and a k . More precisely, if 

• an + 0, we choose a k = -^043/041 and then 5 k = 3^^^^ ■ 

• 041 = 0, we take 5 k = and then a k = fi^~ 1 (Q k+ i + Cfc+i)/«n- 

The described property holds for all k = N — 1, . . . , 1. As free parameters, we choose Sn = &n = 0. 
Note that 5 k can be either zero or 0(/i /3 ). This implies that 

A 3 = 0(1). 

Moreover, if /3 is sufficiently large, the third column of M^ 1 is arbitrarily small and the contribution of 
the product Mr 1 A3 is not greater than 0(/i 3 ( p+1 )) and then negligible for suitable p. 
Now we are going to use elasticity of the transition chain in order to nullify A2. For this purpose, recall 
the notational setting of sec. 3.3 and consider values y k +i and y k of a given ESC. By keeping y k +l fixed 
we want to move y k to a suitable y k G -Ex- 
Note that for all y € E k , the argument we have used to get Ai = A4 = can be repeated simply by 
replacing the coordinates of X_ k ,]C k with those of X k (y),X ' k (y). This is possible as the elements of 
the family k + 1 do not change and the algorithm remains well posed. Now 5 k , cr k looks as functions 
8k(y),Vk(y)- 

Define 2e k := diami^fc and 

°k : = Wv) : V e (yk-e k ,y k }} 

0+ := {6> k (y): y G [y k , y k + e k )} " 

In the time n(/3) the section Sjv is visited by points whose reciprocal distance is not greater than 
0((n(/3))~^) (see e.g. [Gal99] and references therein). As diamO^ = 0(1) there exist subsequences 



n± with n± of the same order of n((3), in a way that 



e k+1 - <v € of. 

Suppose that n k+ \ := n™ for some chosen fh and define 

Hv) ■= i°'k(y) ~ (°k+i - n k+l v)\ mod 2vr = A 2 . (59) 
From (8) we deduce that T{y) is continuous and ^-F{y) = 0(// _1 ) on E k , moreover F{y k )F{y k + 
e k ) < 0, so there exists unique 20 y k E E k such that F(y k ) = 0. 

In such a way HM^T 1 A ||oo ~ and (57) is trivially satisfied. □ 

Remark 4.6. The elasticity of the transition chain could seems as an unnecessary tool to achieve affme 
alignment. Without using it we could obtain 

W^Moo < ^ P Jk^2, (60) 

where J k = max{i, A, -^j, -t+t}- Clearly, due to the decreasing of the sequences b\ A , the greatest 

value of Jfc is Jjy = 0(/i _1 ). In order to satisfy (57), A2 should be of an order not greater than /i 3+p , 
requiring in this way the following ergodization time 

n = 0(M" (p+3) ). 

As it will be clear by looking at the estimates of the next section, during such (long) time the remainder 
of f~ n is not suitably bounded and the affme problem is not still a good approximation of the complete 
one. 

Remark 4.7. The use of the EEC allows us to bypass the ergodization time on S^r so the drift time is 
independent on the Diophantine constants, as in [BB02]. 
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note that this point can be constructively determined with an arbitrary precision. 
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4.2. Estimate of the remainders. Recall lemma 3.7. By construction, functions W\(x) and W2 (x) are 
given by remainders (16) and (24) respectively, evaluated at £ <— (a/,, 0, 8 k , 0) + B k x, we denote with 

£(x),rre(-2,2) 4 . 



Furthermore, set 7^ := \\M, 



Xl 

Y2 



In this way 



oo and define 

sup^ 6W [ 1 1 M~ 1 R k+ 1 ( x ) 1 1 oo + 70 1 1 R k (x )J I oo ] 
suPxew [ 1 1 M k 1 %Rfc+i (5) 1 1 00 + 7o 1 1 dxBk (m) 

X c < max{xi,X2}- 



(61) 



First of all, it is easy to see from (24) that 

Halloo < C 2 U(x)\\l, H^Rfc^lloo < Calie^lUII^^IU, 

where 62,3 < +00 by regular (analytic) dependence on initial data over finite time. Moreover, by a 
comparison between (45) and (54), we have max^ 7^. = 0(^ l ~ p ). So we get 



maxj k \\R k (x) 

k 

uniformly in x. 

By looking at formula (16), we obtain 



maxj k \\dxR k (x) 

k 



max k \\M k R(x 



-X-p 



( 



0(n fc+ i^" 3 ) 



m 2 i/J. _± 
m 3 i 
\ m 4 i/j, _1 



777-43^ 



recalling (56) and (58). Similarly 



max 1 1 M, 1 dxR k (x 



0(n k+1 fi 



p-3\ 





/3+2p 



n/j, 







/ 



Keeping in mind (50), in order to satisfy (33b) (and then (33a)), it is sufficient to choose p = 6 (from 
which /3 > 30). As lim^o n k+if 1 = by definition, we have xi,2 = o(/j 2 ) and the proof is complete. 
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